
################################################
#### Chou, Winston, Kosuke Imai, and Bryn Rosenfeld. 2017. 
#### “Replication Data for: 
#### Sensitive Survey Questions with Auxiliary Information,” 
#### Sociological Methods & Research. Forthcoming.
################################################

####
## Data:
####

The replication archive includes 7 data files:

1. population_data.RData, data from the Mississippi Secretary of State’s voter history file on the population of 2011 General Election voters;
 
2. ms-g11-county-vote-data.RData, official county-level election results from the 2011 Mississippi General Election (sourced from county recapitulation reports);

3. ms-g11-precinct-vote-data.RData, official precinct-level election results from the 2011 Mississippi General Election (sourced from county recapitulation reports);

4. list-data.RData, survey data from the list experiment conducted by Rosenfeld et. al. (2016);

5. rr-data.RData, survey data from the randomized response question conducted by Rosenfeld et. al. (2016);

6. end-data.RData, survey data from the endorsement experiment conducted by Rosenfeld et. al. (2016);

7. dir-data.Rdata, survey data from the direct question in Rosenfeld et. al. (2016).


For further details see “codebook.pdf” and Rosenfeld et. al. 2016. "Replication Data for: An Empirical Validation Study of Popular Survey Methodologies for Sensitive Questions,” The American Journal of Political Science. DOI: 10.7910/DVN/29911. 
 
####
## Documentation:
####

1. This readme.txt file;

2. auxiliary-codebook.pdf, a codebook for the datasets listed above.

####
## Code:
####

The replication archive includes 8 R script files: 

1. aux-fig-1-table-1-table-2-empirical-validation.R reproduces all of the results in Figure 1, Table 1, and Table 2. 

2. aux-fig-2-fig-8-individual.R reproduces the results in Figure 2 and Appendix Figure 8.

3. aux-fig-3-fig-5-simulations.R reproduces the results in Figure 3 and Figure 5.
 
4. aux-fig-4-simulations.R reproduces the results in Figure 4.

5. aux-fig-6-misspecification.R reproduces the results in Figure 6.

6. aux-table-3-direct-comparison.R reproduces the results in Table 3. Note that this script relies on intermediate analysis in aux-fig-4-simulations.R and aux-fig-3-fig-5-simulations.R.

7. aux-table-4-fig-7-covariate.R reproduces the results in Appendix Table 4 and Appendix Figure 7.

8. replication-functions.R includes author-coded functions used in the analysis and is sourced in several of the other scripts.


####
## Software:
####

The methods for incorporating auxiliary information described in our paper can be implemented via the open-source statistical software, endorse: R Package for Analyzing Endorsement Experiments, list: Statistical Methods for the Item Count Technique and List Experiment, and rr: Statistical Methods for the Randomized Response Technique.

The following R package versions available on CRAN were used in the analysis:

list_8.3
rr_1.4
endorse_1.5.0
magic_1.5-6
MASS_7.3-45
arm_1.9-3
coda_0.19-1
lme4_1.1-12
doParallel_1.0.10
foreach_1.4.3
beepr_1.2
xtable_1.8-2
stargazer_5.2 

For compatibility, all of these packages can be installed from source.  
